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Introduction 


In formal language theory, several models characterize regular languages: finite au¬ 
tomata, congruences of finite index, monadic second-order logic (MSO), see e.g. jBiicbOl 
Elg61| lTra6lj . Further connections exist between monoid varieties and logical frag¬ 
ments of MSO, for instance aperiodic monoids have been shown to recognize exactly 
the word languages defined by first-order (FO) formulas [ Sch65. [MP71 J. 

When considering word relations instead of languages, automata are replaced by 
transducers, that is automata with outputs associated with transitions. A transducer 
is an automaton that reads an input word and returns an output word obtained by 
concatenating the outputs of the transitions. However, several important properties 
do not generalize from automata to transducers. For instance, the well known equiv¬ 
alence between deterministic and non-deterministic one-way automata, as well as the 
equivalence between one-way and two-way automata, do not transfer to transducers. 
One property that is preserved is the equivalence between automata and MSO formu¬ 
las: it has been shown li d 101 that MSO word transductions and two-way transducers 
define the same class of word functions called regular functions. A recently intro¬ 
duced model of computation, streaming string t ransd ucers (SST), has been shown 
to compute the same class of regular functions (AClOj . Two recent related results 
are the equivalence between FO transductions and aperiodic SSTs |FKT14 j and the 
equivalence between FO transductions and aperiodic two-way transducers jCDl5j . 
However it is not known if one can decide if a given regular function is FO definable. 

Our contribution: Our result deals with functions that are definable by one-way 
word transducers. These functions are known in the literature as rational functions. 

The notion of minimal automaton goes beyond minimizing the state space. In¬ 
deed to decide whether a regular language satisfies some algebraic property, like 
aperiodicity, it suffices to consider the minimal automaton. Therefore in order to 
have an algebraic characterization of rational functions, we need a notion similar to 
the one of minimal automata for transductions. For the class of functions defined by 
deterministic transducers, such a notion exists jCho03| and this minimal transducer 
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enjoys, among deterministic transducers, the same kind of minimality properties. 

In an attempt to obtain a similar notion for rational functions, we study the 
model of bimachines |Sch61j which has been shown to be a canonical model for ra¬ 
tional functions (see e.g. |BB79j ). We describe a canonical bimachine, introduced 
by [RHM] . and show that this representation preserves, similarly to the minimal 
automaton for languages, some algebraic properties of the function. In the case of 
aperiodic rational function, i.e. functions definable by an aperiodic one-way trans¬ 
ducer, a characterization has already been given in [RS95j. Our main contribution 
is to give an effective characterization of aperiodic rational functions, and extend it 
to other algebraic varieties of rational functions. We also introduce translations , a 
model of logical transductions inspired by and equivalent to bimachines. We show 
an equivalence between transducers satisfying some algebraic property and logical 
translations, for instance between aperiodic transducers and FO-translations. 
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Chapter 1 

Automata for languages and 
transductions 

1.1 Words, languages and transductions 

Let an alphabet S be a finite set of symbols called letters. Let E* denote the free 
monoid on E. An element of E* is called a word and the identity element £ is called 
the empty word. A language is a set of words. 

A transduction R is a subset of E* x E*. For (u,v) G R, u is called an input 
word and v an output word. The domain of R , denoted by dom(i?), is the set of 
input words. A transduction is called functional if it is a (partial) function. For a 
functional transduction / we write f[u) = v instead of (u,v) G /. 

Example 1.1.1. We give two running examples of functional transductions: 

• Let E = {a}. The transduction f eV en ■ E* —> E* copies the word if its length is 
even and outputs £ otherwise. 

• Let E = {a, b}. The transduction f ends : E* —> E* replaces each letter by a 
if the first and last letter are a and yields £ otherwise, e.g., f ends (abaa ) = a 4 , 
f ends {baaab ) = f en ds{abab ) = e. 

1.2 Finite state automata 

A left non-deterministic finite state automaton (NFA or simply automaton) over E 
is a tuple A = (Q, I, F, A), where Q is a finite set of states, / C Q is the set of initial 
states, F C Q is the set of accepting states, ACQxExQis the transition relation. 
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a 



a 


(a) DFA recognizing L even . 


a, b 



Figure 1.1: Two automata. 


We denote (p, a,q) G A by p A a q (we will omit the A when the context is 
clear). We extend the relation to words: p 0 pk when we have p 0 —h pi 

p 2 ■ ■ ■ pk- 1 — Pk- Such a sequence is called a run of A on a\... ak, and if po G I and 
Pk G F. it is called a successful run. We also add q A- q, Wq G Q. 

A word « G E* is recognized or accepted by A if there exist p G /, q G F such 
that p A q. The set of all words recognized by A is called the language recognized 
by A and is denoted by L(A). 

A left deterministic finite state automaton (left DFA or simply DFA) on an alpha¬ 
bet E is defined similarly to an NFA as a tuple A = (Q,qo, F,5) where the transition 
function is 5 : Q x £ —* Q and q 0 G Q is the unique initial state. 

Let A = (Q,I,F, A) be an automaton, and let P\ , F 2 C Q. Then we define 
Lp lt p 2 (A ) as the language recognized by A 1 = (Q, Pi, P 2 , A). 

Remark 1.2.1. Right automata are defined exactly as left automata except that they 
read the words from right to left. Let A = (Q, /, F. A) be a right automaton. Then we 
denote (p, a, q) G A by q <G- p and pk — p 0 when we have pk G- 5 - Pk-i • • ■ Pi p 0 - 
Unless specihed otherwise, an automaton is assumed to be a left automaton. 


Example 1.2.1. The DFA of Figure 1.1 (a) recognizes the language L 
of even length. The NFA of Figure 1.1 (b) recognizes the language L 


ends 


over E = {a, b} that start and end with an a. 


of words 
of words 


1.3 Transducers 

A nondeterministic finite state transducer (NFT) on an alphabet E is a tuple T = 
(Q, /, F, A, i, t), where Q is a finite set of states, / C Q is the set of initial states, 
F C Q is the set of accepting states, A:QxExQ-)-E*isa partial function 
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defining the transitions Q i : I —> £* is the initial function, and t : F — y £* is the 
terminal function. 

Let A (p,a,q) = v be a transition of T. The letter a and the word v are respec¬ 
tively called the input and the output of the transition. We denote A (p, a, q ) = v by 

p q (we will omit the T when the context is clear). We extend the relation to 

<Tl---<Tk\ v l i i (Til'll <J 21 V2 ri 1 

words: p 0 - > Pk when we have p G - > p i - > p 2 • • -Pk -i - > Pk • ouch 

a sequence is called a run of T on Oi... o k) and if p 0 G / and p k G F, it is called 

a successful run. We also add q q, Vg G Q. An NFT realizes a transduction 
noted [T] C E* x E*. We also say that T defines the transduction [T]. The pair 

(u,v) G [TJ if there exist qo > qf such that qo E I, qq £ F and v = i(q 0 ) w t(qq). 
An NFT is called functional if it defines a functional transduction. From now on the 
NFTs considered will be functional. 

A deterministic finite state transducer (DFT) on an alphabet £ is defined sim¬ 
ilarly to an NFT as a tuple T = (Q,qo, F,S,i,t) where the transition function is 
LQxSgQxS* and qo G Q. 

The underlying automaton of a transducer is the automaton obtained by removing 
the outputs from the transitions. Let T = (Q, /, F, A, i, t) be a transducer, and let 
ACQxExQbe the projection of A. Then At = (Q, /, F, A) is called the 
underlying automaton of T. 

Functions defined by NFTs are called rational , functions defined by DFTs are 
called subsequential and functions defined by DFTs without terminal output are 
called sequential. 


Example 

ure 


1.2 


a, 


1.3.1. We give two examples of transducers: The transducer of Fig- 
defines the function f even while the transducer of Figure 1.2 (b) defines 


the function f ends . 


lr This type of transducer is sometimes called real-time jSakQS]- In the general case, a transition 
of a transducer may be labelled by any pair of words 
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Chapter 2 


Logical and algebraic 
characterizations of classes of 
regular languages 

2.1 Congruences and monoids 

Let E be a finite alphabet and let ~ be an equivalence relation on E*. We say that 
~ is a right congruence if it verifies: 

\/u, v G E*, Va G E, m ~ r ^ we ~ va 

Symmetrically we define a left congruence. A congruence is defined as both a left 
and a right congruence. 

Let ~i and ~ 2 be two equivalence relations. We say that ~i is finer than ~ 2 , 
or that ~ 2 is coarser than ~i, denoted by ~i C ~ 2 if for all u,v we have 
u ~ 2 v. 

We recall that a monoid M is a set given with an associative binary operation 
(written multiplicatively) and an identity element which is neutral (both left and 
right) for this operation. A monoid homomorphism (morphism for short) is an 
application /i : M\ —>■ M 2 between two monoids verifying /i(xy) = /i(x)ii{y) for 
x, y G M\ and fx{l mJ = 1 m 2 with 1 Mi being the identity element of M* for i G {1, 2}. 
We say that a language L of E* is recognized by M if there exist PGM and 
H : E* — > M a monoid morphism such that L = yU _1 (P). Let ~ be a congruence 
on E*. We say that L is recognized by ~ if there exists P C E*/ ~ such that 
L = {u | [w] G P}. 
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Let M be a monoid and let /i : £* —* M. Then defined by u n <S=> /z('u) = 
/i(r?) is a congruence, and any language recognized by M is recognized by for some 
/i. Conversely, for a congruence ~ we have a monoid £*/ ~ with a binary operation 
induced by the natural morphism u K)■ [u] such that any language recognized by ~ 
is recognized by £*/ ~. Hence there is a natural equivalence between recognizability 
by congruence and by monoid. 


Example 2.1.1. Let £ = {a}. Let /i : 


be the morphism from 
The 


£* { 0 , 1 } 

a n i->- n mod 2 

£* into the group of two elements (written additively). Then L even = /x _1 (0). 
language L even is recognized by the group of two elements. 

The corresponding congruence is defined by a m ~ a n iff m = n mod 2. Then 
Leven = [e]. The language L even is recognized by the congruence ~. 


2.1.1 Syntactic congruences and monoids 

Let L be a language. We define the syntactic congruence of L as: 

u v <s=> (VT, y xuy e L xvy e L) 

From this we can naturally define the syntactic monoid of L, M(L ) = £*/ 
According to the Myhill-Nerode Theorem [ Ner58j . a language is regular if and only 
if its syntactic congruence has finite index. 

Remark 2.1.1. Let us also show that is the coarsest congruence recognizing L. 

Proof: We see that, by definition, L is a union of classes modulo hence the 
syntactic congruence of L does recognize L. Let ~ be a congruence recognizing L. 
Since ~ is a congruence, if u ~ v then for all x, y G £*, xuy ~ xvy. This means that 
if u ~ v, for all x,y G £*, xuy 6 1^ xvy G L. Hence u ~x v and ~ is finer than 


Example 2.1.2. Figure 2.1 (a) represents the multiplication table of the syntac¬ 
tic monoid of L ends , with elements labelled as representatives of congruence classes 
(omitting the identity element). 

Figure |2T (b) shows the automaton associated with the syntactic congruence of 
Lends, with the states labelled as representatives of congruence classes. 

The automaton associated with the syntactic congruence of L even is simply the 

The syntactic monoid of L e 


automaton of Figure 1.1 
elements. 


is the group with two 
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a 



ba | ba b b ba b 

(a) Multiplication table of the syn- (b) Automaton associated with the 
tactic monoid of L enc is- syntactic congruence of L enc i s . 

Figure 2.1 

2.1.2 Transition congruences and monoids 

Let A be an automaton, then we define the transition congruence of A as: 

u =a v yy (Vp, q G Q u G Lp^ q L Pjq ) 

From this we can naturally define the transition monoid of A, M(A ) = £*/ =a- 

Remark 2.1.2. Let A be an automaton. Then L(A) is recognizable by =a- Indeed, 
if u =a v and u G L(A) then v G L(A). Therefore L(A) is the union of congruence 
classes. This also implies that L(A) is recognizable by the monoid M(A). 

Let A be a left (resp. right) deterministic automaton, then we define the right 
(rep. left) congruence associated with A as: 

u v yy (Vp G Q u G L q0tP yy u G L qo . p ) 

Conversely, let ~ be a right (resp. left) congruence. We can define a unique de¬ 
terministic left (resp. right) automaton without final state associated with ~ as: 
A,^ = (£*/ ~, [e], <5) with 5 being the natural right (resp. left) action of £ on £7 ~. 

Remark 2.1.3. Let A be a left (resp. right) automaton. If we assume that A is trim, 
i.e. any state appears in at least one successful run of A, then u =a v if and only if 
for all w G £*, wu wv (resp. mu vw). 

Let us also remark that in that case, = A is the coarsest congruence to be finer 
than ~a- 
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For a transducer T, we will use the notations M(T), and = T when referring to 
the transition monoid of the underlying automaton of T. 

2.2 Monoid varieties 

Let U be a finite set of variables and si,s 2 £ U*. We say that a monoid M satisfies 
the equality Si = s 2 if f° r any morphism 77 : U* —> M , we have t/(si) = ?/(s 2 ). We say 
that a congruence ~ satisfies the equality Si = s 2 if for any morphism 7j : U* —» £*, 
we have 77 (si) V(S2)- 

A monoid pseudovariety (variety for short) is defined [Str96] as a set of finite 
monoids that is closed under finite direct product, the taking of submonoids (he. a 
subset stable by multiplication containing the same identity element) and of homo¬ 
morphic images. We will use the Theorem given in |ES76j to characterize monoid 
varieties. Let E be a countable set of equalities. Then we say that the set of fi¬ 
nite monoids satisfying all but finitely many equalities in E is the monoid variety 
ultimately defined by E. The Theorem states that any monoid variety is ultimately 
definable. In the following we will use this characterization as a definition. 

Let V be a monoid variety, we say that an automaton (resp. a transducer) is a 
V-automaton (resp. V-transducer ) if its transition monoid is in V. A language is 
called a V-language if there exists a V-automaton recognizing it. A congruence is 
called a V-congruence if the monoid associated with it is in V. 

Proposition 2.2.1. Let V be a monoid variety. 

• Let ~ be a congruence. Then ~ is a V-congruence if and only if it satisfies all 
but finitely many equalities defining V. 

• Let ~ 2 be two congruences such that ~ 1 C~ 2 . If ~i is a V-congruence, 
then so is 

Proof: Let U be a finite set of variables. Let M = £*/ ~ and let g : E* — > M be 
the natural morphism. Let us assume that ~ is a V-congruence. Let g : U* —> £* 
be a morphism. Since M e V, we have by definition that if M satisfies Si = s 2 then 
pog(si) = go ?/(s 2 ) which means that g(s 1 ) ~ 77 (s 2 ). Conversely, let us assume that 
~ satisfies all but finitely many equalities of V. Let 77 : U* —> M be a morphism. 
Let us show there exists a morphism v : U* —)• E* such that 77 = g o v. We only have 
to give the image of any u G U to describe a morphism. We define v{u) = w u with 
w u G g ^ ( 77 ( 77 )) (which is not empty since g is surjective). This yields a morphism 
which verifies 77 = go v. Hence we have that if ~ satisfies Si = s 2 then zz(s 1 ) ~ zz(s 2 ) 
which means that 7 ](si) = 77 (s 2 ). 
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Let Si,s 2 G U* such that ~i satisfies Si = s 2 , he. for any morphism r; : U* E*, 
V(si) rv_/ 77(52). In particular, 77 ( 51 ) ~ 2 ^(s 2 ), which means that ~ 2 satisfies si = s 2 . 
□ 


Remark 2.2.1. Some of the most commonly used varieties are shown in Figure [ 6 Tl] 
A particular case is the variety of aperiodic monoids, denoted by A, which has been 
shown to recognize the languages definable in first-order logic I Sell 65 . IMP71] . 

For a given variety V, the membership problem, i.e. deciding if some monoid 
belongs to V, is not decidable a priori. For a discussion on this problem see [Pin97j . 
However, if the membership problem is decidable, then the problem of deciding if a 
language is a V-language is also decidable. Indeed, according to Remark |2 .1.1 1 and 
Proposition |2.2.1 one only has to check that the syntactic monoid of the language 
belongs to V. 


2.3 Logics 

Given an alphabet E, monadic second-order formulas (MSO formulas) are built over 
first order variables x,y ,... and second order variables X,Y, .... Atomic formulas 
are built using the binary predicate < and for each cr G E a unary predicate, also 
denoted by cr. The formulas are defined by the following grammar: 

p ::= 3X ip | 3x p \ p A p \ -xp \ x G X \ a(x) | x < y | (p) 

Universal quantifiers and other boolean connectives are defined as usual: VX p = 
-BX -1 ip, \/x p = -iRe -xp, p\ V ip 2 = _l ( _l 7 , i A -xp 2 )i y ^2 = _l 7 , i V p 2 . We 
also define the constant formulas T and _!_ which are respectively always and never 
satisfied. 

We do not define the semantics of MSO formulas, neither the standard notion of 
free and bound variables, but rather give examples and refer the reader to [ EF95] 
for formal definitions. 

Given a closed formula p, and a word w satisfying p, we write w |= p and L(p) 
denotes the language of words satisfying p. 

A logical fragment of MSO (logic for short) T is a subset of formulas of MSO. 
Given a language L , we say that L is ^-definable (or an ^-language) is there exists 
a closed formula p G T such that L = L(p). 

Example 2.3.1. We define the successor relation as: 

su cc(x,y) — (x < y) A (Sz (x < z) A (z < y)) 


13 

















We define the minimum and maximum unary predicates: 


min (a;) = ->3 y (y < x ) 


max(x) = -i3 y (x < y) 

Then we define the MSO formula (p even which defines the language of words of even 
length: 


[Peven 


3X 3E Vx Vy 
A 
A 
A 


xelA —i(x G Y) 
min(x) Axel 
max(?/) Aj/Gh 

succ(x, y) —> (x G X A y G E) V (x G Y A y G X) 


We give below several logics and refer to Figure [671] to see the languages defined 
by these logics. 

First-order formulas (FO) denotes the logic of formulas without any second order 
variables. 


Example 2.3.2. Let £ = {a,&}. We define the formula Fends which defines the 
language L ends : 


Fends = 3x 3 y min(x) A a(x) A ma x(y) A a(y) 

MSO[=] denotes the logic of formulas without the binary predicate <. 

Let k be a non-negative integer, then F0 k denotes the logic of FO formulas with 
only k variables. 

Let V be a monoid variety and let J be a logic. We say that T is a logic 
corresponding to V if for any language L we have: L is an ^-'-language if and only if 
L is a V-language. 
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Chapter 3 

Logic-bimachine correspondence 

3.1 Bimachines 

Here we will describe the notion of a bimachine, introduced by Schiitzenberger in 
|Sch6l| . which is in some sense both left and right sequential (or subsequential). 
Bimachines have been shown, see e.g. [BB79] . to define exactly rational functions 
and are in that sense equivalent to functional NFTs. 

A bimachine is given by a tuple B = (L, R, co, A, p ) where: 

• L is the set of states of a deterministic left automaton given with an initial 
state Iq. 

• R is the set of states of a deterministic right automaton given with an initial 
state r 0 . 

• The output function cj : L x S x 1? 4 S* which may be partial. 

• The terminal left function p : L —)■ £* and the terminal right function A : R —> 
£* which may both be partial. 

The output function is extended to words in this fashion: Let u — G\... a n G 
let l —L 1 1 ■ ■ ■ l n _i -A l n be an execution of L on u from l and let r n -fT r ri _ 1 ■ • • rq ^ 2L 
r be an execution of R on u from r. Then, if it is defined: 

u(l,u,r) = r n _i) ■ • -uj(l n -i,a n ,r) 

If l — l 0 and r = r 0 , when it exists the image of u by B is defined as: 

{Bj(u) = \(r n )uj(lo,u,ro)p(l n ) 
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ft 



a, b 

a, b 

Figure 3.1: Left automaton of B. 



Run of L 
Input 
Run of R 
Output 


Figure 3.2: Execution of a bimachine 


Remark 3.1.1. In order to simplify, we will write L (resp. R) when referring both to 
the set of states and the automaton. 

Remark 3.1.2. When R is reduced to a single element then the bimachine is simply 
a subsequential transducer (sequential if p is a constant function equal to £ on its 
domain). 


Example 3.1.1. We describe a bimachine B = (L, R,oj, A, p) defining f en ds- The 
automata L = {lo,l a ,h} and R = {r 0 ,r a ,rb} are identical (except that one is a left 
automaton and the other is a right automaton) and L is shown in Figure 3.1 The 
functions A and p are constant and equal to e, oj(l,a,r) — a if l G {loja}, cr = a 
and r G {r 0 ,r a } or if l = l a , a = b and r = r a . In the other cases cu(Z,a, r) = e. In 
Figure [372] we show the execution of this bimachine on the word abaa. 


A bimachine is called complete if its output function is total. 


Proposition 3.1.1. Any bimachine is equivalent to some complete bimachine. 
Proof: The idea of the proof is to extend the output function by outputting £ 
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when it is not defined. To ensure that the domain of the function is preserved, we 
refine the left (for instance) automaton to only accept words in the domain. 

Let B = (L, R,uj, A, p) be a bimachine defining a function /. We construct 
B' = ( L',R,co ', A, p') a complete bimachine equivalent to B. Let A be an automaton 
recognizing dom(/), which is rational. We define L' = L x A. Let (l, q) be a state of 
L' and let r G R. Then u/((Z, q), a, r ) = oj( 1, a, r) if it is defined and uq),a,r) = e 
otherwise. If q is a final state of A then p'(l, q ) = p(l), otherwise p'(l, q) is not defined. 

If a word belongs to dom(/) then the outputs in the run of B’ are the same as 
the ones in B. If a word does not belong to dom(/) then it is not accepted by L'. 
Hence B and B’ are equivalent. □ 


3.1.1 From bimachines to transducers 

Let B = (L, R,oj, A, p) be a bimachine defining /, we can construct an NFT T = 
(Q, /, F, A, i, t ) defining the same function: 

• Q = L x R 

• I = {/ 0 } x dom(A). 

• F = dom(p) x {r 0 }. 

• A((Zi,r 1 ),(r, (l 2 ,r 2 )) = v 

if l\ l 2 , r 1 <— r 2 and cu(/i, a, r 2 ) = v. 

• i : ( lo,r ) A(r) whenever (lo,r) G /. 

• t : (Z, r 0 ) p(Z) whenever (Z,r 0 ) G F. 

3.1.2 From transducers to bimachines 

An NFA A = ( Q, /, F, A) is called unambiguous is for any word u = o\... a n G 
there exists at most one sequence qo —h qi ■ ■ ■ q n _ 1 -A with q$ E I and q n G F. An 
NFT is called unambiguous if its underlying automaton is unambiguous. It has been 
shown that any functional NFT is equivalent to some unambiguous NFT [ BB79 J. 

Let T = (Q, /, F, A, i, t) be an unambiguous NFT, we can naturally describe an 
equivalent bimachine B = (L, F, oj, A, p) with: 

• L — E*/ =t with the right action of £ as its transition function: [w] -G [ttcr] 
and Z 0 = [ £ ]- 
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• R — Tj*/ = t with the left action of S as its transition function: [au] G- [w] and 

r o = [e]- 

• u(l,a,r) is defined as the unique (by unambiguity) v such that 3ui G l,u r G r 
and p,q G Q with Ui G L I p , u r G L qF and A(p, a, q ) = v. 

• A((Zi, ri), a, ( l 2 , r 2 )) = v if Zi -G l 2 , rq G— r 2 and cu(/i, a, r 2 ) = u. 


[u] i—^ z(p) 


where p is the unique (by unambiguity) p G / such that 


^ [ti] i—f(p) 

u G Lj p . 


where p is the unique (by unambiguity) p £ F such that 


3.2 Logical transduction 

In the paper [FilTFj an equivalence is given between NFTs and so called order¬ 
preserving logical transducers. The following definition coincides with this class of 
order-preserving logical transductions, in the logics of MSO[<] and FO[<]. It is 
also somewhat similar to the notion of logical translation given by |MSTV06 ] in the 
particular case of length-preserving transductions. 

Let T be a logic. An ^-'-translation is given by a tuple 

T — l<j<k , {}Pv) v& s ’ { ( Pv) v£ s ) 

V >gS J 

T is composed of k > 0 and ScS* finite, the set of outputs. For 1 < j < k, a G £ 
and v G S, <pf jrT:V , Vv an d </4 are dosed ^-formulas. 

By definition (u, w) G [7~J if: 

for u = oq ... a n G £ n there exists a decomposition w = wqw± ... w n w n+ i G 
S n+2 such that: 

• for i G {1,..., n} there exists j, with 1 < j < k such that we have 
both: a x ... CTj_! |= pf^ Wi and a i+1 ... a n |= 

• u \= <pl 0 and u |= 

We want to ensure that the domain of the translation is an J-'-language. For this 
we add the following condition of exhaustiveness: 


18 






For any a G E, u, w G E*, there exists a pair { ( pf a - v i ( P'j av ) with j < k 
and v e S such that u \— <pf av and v |= Tpa,v 

Remark 3.2.1. An ^-translation T = lk,S, (<pf t(TjV , ( ff, a , v ) 1 <j<k veS ’ (vDveS ) 

\ aeE,v£S J 

might not define a function but a relation. In order to ensure that it is functional, 

we have to add some conditions: 

• Let j, f < k,a E H,v 1/^6 S. Either < pf t(TtVl A <pf/ aV2 is unsatisfiable or 
Vjw! A v},a. V2 is unsatisfiable. 

• Let Vi V‘ 2 G S, then ip 1 A ip l V2 is unsatisfiable. 

• Let Vy v- 2 . E S, then (p* A (p\ is unsatisfiable. 

In the following we will always assume that a translation satisfies these conditions. 
Example 3.2.1. We give a translation defining f en d s ■ Let E = {a,b} and T = 

( k ! S 1 (<Ptr,vi <Pjw) 1 <3<k r (<P[) veS > MveS ) with k = 2 and S = a }' 

V <t£E,v€S J 

• ^ga, e = 3a: min(x) A b(x) and c p^ £ = T. 

• via,e = T and ^ 0>e = 3a; max(a;) A b(x). 

• <Pi,aa = _| 3a; min(x) A b(x) and (pi aa = ->3a; max(o;) A b(x) 

• Vibe = “^a; min(x) A a(x) and (pf be = T. 

• T 2 be = 3~ and ¥2 be = max(a;) A a(x). 

• Tib a = 3a; min (a;) A a(x) and (pf ba = 3a; max(a;) A a(x ) 

• tpl — T and (p\ = T. 

• All the other formulas are set to A. 

3.2.1 From machines to logics 

Proposition 3.2.1. Let V be a monoid variety and let J 7 be a corresponding logic. 
Let f be a function definable by a complete V-bimachine. Then f is definable by an 
J 7 -translation. 
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Proof: Let B = (L, R,u>, A, p) be a complete V-bimachine. We can assume that 
the automata L and R are complete (for instance by taking the automata associated 
with and respectively). Let l G L , then there is an J-'-formula pf recognizing 
the language Li 0 j(L). Similarly for r G R let pf be an J-'-formula recognizing 


J ro,r 


(R). Let us define T = ik,S, {p< av , (pf ) i <j< k , (<pi) veS , (pt) v es with k 

\ a£Y,,v£S J 


\L\ ■ |i2|’, S being the set of outputs in B. For convenience, we will index the formulas 
with pairs of states instead of integer < \L\ • \R\. Let l G L, a G E, r G R, v G S 
such that tu(l,a,r) = v. We define < p‘n r ) av = pf and (p'n r ' )(TV = pf- We also define 
for v G S, p\ = f\ X ( r \ =v pf and p\ = /\ p ^ =v pf- We can see that since L and R are 
complete and to is total, the translation is indeed exhaustive. 

Let (u,w) G [5]. Let u = o\... a n G let Iq —b l\ ■ ■ ■ l n -i l n be the 
execution of L on u and let r n G— r n _i ■ ■ ■ r\ <— r 0 be the execution of R on u. Let 
w = v 0 vi... v n v n+ i such that cr,, r n _j) = Vi for 1 < i < n, X(r n ) = v 0 and 

and 




p(l n ) = v n+ \. Then for 1 < i < n we have both o\... &i-\ \= pf v i r 
a i+ i ...a n (= pfi.^r^.)^.- We also have u \= p l VQ and u \= p\ n+x - Hence (u,w) G 
[71. Conversely let us assume (u,w) G [T]. With the same notations as above, 
we have for 1 < i < n, a ,... \= pf u _ urn _ i)tlTitVi and a l+1 ...a n \= pf k _ urn _ i)ilTitV , 

We also have u |= p l VQ and u \= p\ ln+] ■ We just have to remark that the sequence 
of left (resp. right) states corresponds to an execution on L (resp. R). Then we 
have automatically that v t = a;(Z*_i, cq, r n _*), A(r n ) = vq and p(l n ) = fn+i- Thus we 
obtain (u,w) G [HJ. □ 


3.2.2 From logics to machines 

Proposition 3.2.2. Let V be a monoid variety and let J 7 be a corresponding logic. 
Let f be a function definable by an J 7 -translation. Then f is definable by a complete 
V-bimachine. 

Proof: Let T = ( k, S, (pf a v , pf^ v ) l <j<k , ( P* v ) veS , ( Pl) veS I be an ^-translation. 
V ct£E,v£S J 

We will describe a bimachine B = (L, R,co, A, p) equivalent to T. We define 
two sets of formulas: F L = {pf a v | 1 < j < k, a G E, v G S'} U {pi \ v G S} and 
F R = I 1 <J<k, a G E, v G S'} U {pi \ v G S}. For any formula in Fl there 

is a V-congruence recognizing the same language. Let be a V-congruence finer 
than all of these. Similarly we can define and we obtain naturally the automata 
L, R. Then for [u] G L, [w] G R and a G E, cu([xt], a, [w]) = v with v being the unique 
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word in S such that for some j < k, u \= ^pf_ a _ v an d w 1= '■f f.o.v • Let us remark that 
co is total by exhaustiveness. Hence we obtain a complete V-bimachine and exactly 
as above, we can show it defines /. □ 

Remark 3.2.2. If J 7 is a logic corresponding to a variety V, it can easily be seen that 
J-'-translations with the formulas if j a v = T are equivalent to V-DFTs. Moreover, if 
we set for v ^ e that 9 ?* = _L, we obtain the functions definable by V-DFTs with no 
terminal output. 
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Chapter 4 

Algebraic characterization of 
classes of subsequential functions 


The first part of this chapter deals with a minimization procedure described by 
[ Cho03j and we show it preserves the equalities of the transition congruence. In 
particular, one can decide if a subsequential function is definable by an aperiodic 
DFT. In the second part we show that a determinization algorithm of transducers 
preserves the aperiodicity of the transition monoid. 


4.1 Minimization 

In order to have an algebraic classification of functions, we need a notion similar 
to the one of minimal automaton. In the case of subsequential functions, there 
is such notion which we call the minimal DFT. We describe the construction of a 
minimal DFT given in [ Cho03] and prove that it preserves equality in the transition 
congruence. However we do not prove here that it is correct and refer the reader to 
the original paper. This minimization is canonical and not machine dependent. 

Let / be a subsequential function definable by some DFT T = (Q, q 0 , F, 

For u G £*, we define when it exists f(u ) = A {f{uw) \ uw G dom(/)}. We define 
the syntactic congruence of /, for u,v G £*, u v if: 

For all w G £*, uw G dom(/) <G> vw G dom(/) and when it is defined, f {u)~ l f {uw) 
J{v)~ l f{vw). 

Then the transducer Tf = (Qf,qof,Ff,6f,if,tf) is equivalent to T with: 

• Qf = IN I u G dom(/)| 
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• q 0 f — [e] (we assume that the domain of T is not empty). 

• 5([it],cr) = ([ua], f(u)~ l f(ua)) when finer) is defined. 

• Ff — {[w] | u G dom(/)}. 

• v([ £ D = /(e)- 

• A(M) = /(w) _1 /( M ) whenever [u] G Ff. 

Proposition 4.1.1. Given a monoid variety V, let f be a subsequential function. 
The function f is definable by a V-DFT if and only if its minimal transducer is a 

V-DFT. 


Proof: According to Proposition 2.2.1 we only have to show =t b = t Let / be 
a subsequential function definable by some DFT T = (Q, q 0 , F, 5,i,t). For q G Q let 
T q = (Q,q, F,5,e,t) and let f q be the corresponding function. Let mi,m 2 G E* two 
words such that m\ =t rn 2 . We construct Tf using the minimization in [Cho03j . Let 

v be a word in E*. Let go —— > P (if P does not exist, we can add a garbage state 


in T). Then obviously q 0 


vm 2 \x 2 


» p and for all w G E*, vm\W G dom(/) vm 2 w G 


dom (f). Then when it is defined: 

f(vrrii) = ixi A {f p {w) | w G dorn(/ p )} 


f(vm 2 ) = ix 2 /\ {f P {w) | w G dom (f p )} 

Let s = A { fp(w) | w G dom(/ p )}, then for any w G E* such that vm\W G dom(/): 
f (vmi )^ 1 f (vmiw) = s~ 1 xf 1 i~ 1 ix 1 f p (w) = s _ 1 /p(^) 

f{vm 2 )~ l fivm 2 w) = s~ l xf 1 i~ 1 ix 2 f p {w) = s _1 / p (w) 

Hence vm,i um 2 for any n G E*. We have in Tf for any d G S': [u] — b [rmj = 
[r:m 2 ] and [u] [um 2 ]. Hence mj = T m 2 and = Tf is the coarsest amongst the 

transition congruences of all the DFTs defining /. □ 


Theorem 4.1.1. Let V be a monoid variety. Let us assume that the membership 
problem is decidable for V. Then there is an algorithm to decide if a function defined 
by a DFT is definable by a V-DFT. 
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Figure 4.1: Minimal DFT of g. 


Proof: This follows easily from Proposition |4.1.1| We only have to construct the 
minimal DFT and check if its transition monoid belongs to V. □ 


Example 4.1.1. Let us consider the sequential function g which is equal to f en d s , 
restricted to L ends . In other words, g en ds is defined on words that start and end with 
an a and replaces each letter of the input by an a. We have: 

• ?(e) = a. 

• For w G a + aE*a, g(w) = a ) w L 

• For w G aT,*b , g{w) = a^ +1 . 

• For w G bT,*, the function is not defined. 

Hence we obtain the following congruence classes: [e:], [a] = [a + a£*], [ab] = [aT,*b] 
and [6] = [6E*]. From this we obtain the minimal DFT of g given in Figure [4~Tj with 
states labelled as representatives of congruence classes. 

4.2 Determinization of aperiodic transducers 

It has been shown that it is decidable whether an NFT is detcrminizable or not. 

In this case we will describe the determinization algorithm given in [BC02 j and 
show it preserves the aperiodicity of the transducer. In particular, we have that a 
subsequential function is definable by an aperiodic DFT if and only if it is definable 
by an aperiodic NFT, which does not hold for a general variety as can be seen in 
Section 16.21 

Let T = (Q, /, F, A, i, t ) be an NFT. We give a construction of V = ( QS 0 , F', 5, i', t') 
equivalent to T. Since we are only interested in the transitions of T', we will not 
describe the final states nor the terminal output function. 
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In the obtained transducer T', a state S' is a set of pairs (p,w) G Q x E*. Let 
X C E*, X ^ 0 then we define AX to be the longest common prefix of all the 
words in A". Let i! = A {i(q) \ q £ /}. Let S'o = {( q,w ) | i(q) = i'w}. Then for 

Si a state and a G X, we define i? 2 = | (q,vu) \ (p,v) G Si and p — gj. Let 
s = A{w|(?,w)G 1? 2 }. Then we define S' 2 = {(g,u;) | (q,sw) G i? 2 } and there is a 

transition Si S' 2 in T'. Let us remark that, by construction, in the underlying 
automaton of T', Si —> S' 2 implies that the states of S' 2 are exactly the states reachable 
by reading u from a state of Si. 


Proposition 4.2.1. Given a subsequential function defined by an A-NFT, the trans¬ 
ducer obtained by determinization is an A-DFT. 


Proof: First let us denote by u -< v that u is a suffix of v. And let u ~ u if u -< v 
or v -< u. 

Let / be a rational function defined by an A-NFT T = (Q, /, F, A, i, t). There exists 
an integer n such that in the underlying automaton of T, for all p, q G Q, we have 


„n+l 


p —» q if and only if p -> q. 

An automaton is counter-free if for any state q, any word u, any positive integer 


k, q —>■ q =>■ q A q. Since T' is deterministic, we only have to show that it is 
counter-free (see e.g. [ DG08] ). 

Let u be a word, let Rq be a state of T' and let k be, if it exists, the smallest 


Hjk n|tCO 

positive integer such that Rq —>■ Rq. Let Rq -> Ri ..., Rk-i —-A Rq be the 

corresponding sequence of transitions in T'. We have to show that k — 1. Let us 
remark that all the states of the Rfi s must be the same since they are the set of states 
reachable, in T, by u n from the states of R 0 . Let Rj = {{q { , ..., (q mi r mj )} be 

pairwise distinct for j G {0, ..., k — 1}. We assume that not all xfis are empty words, 
otherwise, the conclusion is immediate. 


u \ x il,i2 

Let q t] - —> qi 2 denote the transitions in T. 

have for any for j G {0,..., k — 1}: 


Then by construction of X", we 


Al .j^i\ .12 ^j-\-Rh 2 ,j+1 

Let *o, ii,... fit be an index sequence such that qi t A q ll+l for l G {0,..., t — 1} and 
t a multiple of k. Then we obtain for any j, f G { 0 ,..., k — 1 }: 

v io,j x ia,h ‘ ‘ " x j +1 ‘ " ‘ x j+t v it,j 

Aq ,j' x iQ,ii ' ' ' x it—i,it X j'-\~ 1 ’ ’ ’ x j'+t x it,j' 
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In particular the two words have a common suffix which means that v ittJ and v luJ i 
have a common suffix. This suffix can be arbitrarily large since t can be chosen 
arbitrarily large. Hence v l ^ cs v t ^ for any (any state q t is reachable by a 

sufficiently long sequence). 

Let X 0 = x 0 ■ ■ ■ Xk- 1- For j e { 1 ,..., k — 1 } and let Xj = Xj ■ ■ ■ , Xj _i denote 
the corresponding rotations of X 0 . Let us remark that XjXj = XjX 3 + i. Since T' is 

yt 

finite, there exists an index i such that we have both: q.j —>• qj and q t - > q t , and 

let i — i\, i2, ■ ■ ■, it, i and i = i\ , i ' 2 ,..., i be the corresponding index sequences. 
By aperiodicity, we can choose t = sk, a multiple of k. Let Y = x ilji2 • • ■ and 
let Y' = Xj' ,-/••• Xj' j' . Then we have for any ?': 


VjjY = (. Xj) s Vij 

^i,j+ lA (Xj+ 1) Vi,j+1 

VjjY = (Xj) XjVjj+i 


Let us assume that Vjj -< v lJ+ \ which 

means that 

^i,j+ 1 

v^Y = 

( X j) Sy i,j 

(«) 

wVjjY = 

(X j+1 ) s wVij 

W) 

VijY' = 

(XjYxjWVij 

(7) 


From (a) and (f3), we have: w(Xj) s = (X 3 + \) s w. And by multiplying by Xj and 
using XjXj = XjXj + 1 we have: Xjiv(Xj) s = (Xj) s XjW. 

From (7), by multiplying by Y' on the right k times we have: Vij(Y') k = 
(( Xj) s Xjw) k Vjj. Moreover, since ( Y') k corresponds to a sequence of length a multiple 
of k , we also have: Vjj(Y') k = (Xj) ks+ 1 v i: j. Hence (Xj ) ks+1 = (( Xj) s xjw) k . And by 
combining the two we obtain (Xj ) ks+1 = (Xj) ks (xjw) k , hence: 


Ah, 


(XjW) 


k 


This yields 

(X j+ i) s w = w(Xj) s 
= w(x 3 w) ks 

= ( WXj) ks W 

Hence 


Xj+i = (w Xj ) k 


Let qi be another state such that qi —t qj. If Vij -< Vij + 1 then U/j+i = WiVij. 
However, it also implies that (xjw) k = ( x 3 wi) k , hence wi = w. On the other hand, 
let us assume Vij >~ Uy+i with WiV^j + 1 = Vij. Let T) and Y{ correspond this time to 

sequences associated with qi q L and qi ——* qi, respectively. 


26 



wiVij+iYi = (. Xj) s wiVi J+ i 

vij+iYi = (X j+ i) s vi J+ i 
vi, j+ \Y{ = (. X j+1 ) s x j+ i ■ ■ ■ Xj-iWiVij+i 

As above, the first two equations give: ( Xj) s wi = wi[X J+ \) s 

From the third equation we have: ((Xj + i) s x j - + i • • • Xj-\Wi) k = (X, + i) fcs+fc ~ 1 . Let 
us remark that (Xj + i) s Xj+i • • • Xj-i = x J+ \ ■ ■ ■ Xj-\(Xj) s , hence • ■ • Xj-iWi 

Xj + 1 • • • Xj_iWi(Xj + i ) s . From this we obtain (xj +1 ■ ■ ■ Xj^\w{) k = (X J+1 ) fc_1 . 

Now from above we can write: 


{Xj +1 • • -Xj-iWi) k 

= (( WXj ) k ) k-1 

%j -\-1 * * ’ %j—l r Wl 

= \wXjY ~ 1 

WXjXj+i • • • Xj-iWi 

= ( WXj) k 

wXjWi 

= X j+ 1 


Hence by length, we obtain w = Wi = e. 

Let qi be a state reachable from ^ (any state is reachable from such a looping 
state). Then it can be reached by a sequence of length t and there is a correspond 
sequence of outputs denoted by Z\ such that: 


VijZi = (Xj) s vij 

wv^Zi = (. X j+1 ) s v iJ+ i 


Hence wvij = vij + 1 . 

Finally, w is a common prefix to all Vi>j + 1 (j fixed and i' £ {1, • • • , m}) and must 
by definition be empty. The reasoning is the same if we assume y v it j + \. Hence 
Vi'j = Vi'j + i and therefore k — 1, which means that T' is counter-free and thus is 
aperiodic. □ 


Remark 4.2.1. In Section |6.2 we give several examples of varieties V for which, 
contrary to the variety of aperiodic monoids, determinization of a transducer does 
not preserve the variety of the transition monoid. In fact we even give examples of 
subsequential functions which are not definable by any V-DFT yet are definable by 
a V-NFT. 


Example 4.2.1. Let g be defined by the aperiodic NFT of Figure |4~2| (a). Then the 
DFT obtained by determinization is the one of Figure 4.2| (b). 
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(a) NFT defining g. (b) Determinized transducer. 


Figure 4.2: Determinization of a transducer 


28 




Chapter 5 


Algebraic characterization of 
classes of rational functions 

In this chapter we give a characterization of functions that can be defined by a V- 
NFT for a given variety V. The goal is ultimately to decide if a function can be 
defined by a logical translation for a logic corresponding to the variety. The notion 
of minimal automaton does not transfer to NFTs. However there is some notion of 
canonical bimachine, introduced in |RS91j . which is the foundation of the algebraic 
characterization we give. 

5.1 V-transducer and V-bimachines 

Since the algebraic characterization deals with bimachines, we need to define a notion 
of bimachine equivalent to V-transducers. A bimachine is called a V-bimachine if 
both its automata are V-automata. We show the following relations between V- 
transducers and V-bimachines. 

Proposition 5.1.1. Let V be a monoid variety. Let f be a rational function. Then 
there is a equivalence between the three following properties: 

1. The function f is definable by an unambiguous V-transducer. 

2. The function f is definable by a complete V-bimachine. 

3. The function f is definable by a V-bimachine and dom(/) is a V-language. 
Furthermore there is a construction from each one to the others. 
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Proof: 1. =>- 3. follows from the construction given in Section 3.1.2 Since = T is 


a V congruence, we have that L and R are both V-automata. Moreover, dom(/) is 
a V-language since it is accepted by the underlying automaton of a V-NFT. 

We can choose 


3. 2. follows from the construction given in Proposition 3.1.1 


the automaton A to be a V-automaton since the domain is a V-language. Hence we 
obtain a complete V-bimachine. 

Since the bimachine 


2. 1. follows from the construction given in Section 3.1.1 


is complete, the underlying automaton of the obtained transducer is just the prod¬ 
uct L x R, with R being the left automaton obtained by reversing the transitions 
in R. We have that =^, and =r are equal. Therefore the transducer we obtain is a 
V-transducer. □ 


Remark 5.1.1. For some variety V, there exist functions definable by a V-bimachinc 
for which the domain is not a V-language. An example is given in Section [673 

A question that remains is the following one: If a function is definable by a V- 
NFT, is it definable by some unambiguous V-NFT ? In the next part, we will see 
that the answer is yes in the particular case of aperiodic transducers. 


5.2 Canonical bimachine 

In this part we give the construction, for a given rational function, of a canonical 
bimachine that is described in [ RS91 J and several properties of this bimachine. It 
is canonical in the sense that it is not machine dependant and only depends on the 
function itself. Although, we give the construction of the bimachine, we refer the 
reader to the original paper for a proof of correctness. 

In order to define the bimachine we define a left congruence from which we define 
the right automaton. Then from this we define the right congruence which yields the 
left automaton. 

5.2.1 Left congruence 

Let / be a rational function on E. 

Let the left distance between two words u, v be ||w, u|| = |w| + |w| — 2\u A v\ where 
u A u is the longest common prefix of u and v. 

We define the left syntactic congruence of / (which is a left congruence) by 


30 










u ~ Ro v if: 

Vu> G E*, wu G dom(/) wv G dom(/) 
and 

sffiWs* {\\f( wu )J( wv )\\} < 00 

Let be a left congruence finer than ~^ 0 . Let R = E*/ ~r, Ro = E*/ ~r 0 
and let r$ = [e] be the initial element of R. 

For r, r' G A, cr G E, if r' e- r then for convenience we denote by r' = ar the 
action of E on R. 

For this fixed R we construct a bimachine B R = ( L R , R*, u R , X R , p fi ) which defines 
/ according to |RS9lj . By taking R 0 as a right automaton we get a completely 
canonical bimachine B° = (L°, Ro, cu 0 , A 0 , p°). Since we are only interested in the 
automata of the bimachine, we will not give the descriptions of u R , X R and p R , and 
once again we refer the reader to |RS91j . 

Remark 5.2.1. The construction is completely symmetrical: we can define similarly 
a right congruence: u v if the supremum of the right distance between f(uw) 
and f{yw ) over all words w is less than infinity. Then for any right congruence 
finer than ~£ 0 , we can define B L = (L, R L , uj l , X L , p L ) defining /. We denote R L ° 
by R° to simplify. 

First we show that the automaton Rq (as well as Lq) has some minimal prop¬ 
erty, i.e. its transition congruence is coarser than the transition congruence of any 
transducer defining /. 

Proposition 5.2.1. Let f be a rational function defined by some V-NFT. Then the 
automaton R 0 = (E*/ ~_r 0 , [s],5) ; where S is the natural left action on E*/ ~h 0 ; a 
V-automaton. 


Proof: Let T = (Q, /, F, A, i, t) be a V-NFT defining /. According to Proposi¬ 
tion 2.2.1, it suffices to prove E =r 0 - Let w G E* be two words, let mi = T m 2 

and let q 0 —p mi —> ■ qf be an accepting run of T on wm\. Then there is an accept- 


qjj 7712 | 

ing run q 0 —-> p- y qf. Thus the distance \\f(wmi), /(rcm 2 )||< k(\mi\ + |m 2 | + 1) 

with k being the maximum length of an output in T. Hence m 1 m 2 . □ 


5.2.2 Right congruence 

Now that we have defined the right automaton of the bimachine, we need to define 
the left one. In order to do that we define this time a right congruence. 
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Let r G R, we define the function 


fr{u) = A {/M I ur o = r} 


These functions are similar to the prefix function defined for minimizing subse- 
quential functions in Section 4.1 However this time we need a finite family of prefix 
functions. It is proven in [RS91] that f ar ( u ) is a prefix of f r {ua) for a G G £* 
and r G R. Then for u,v G E* we say w v if for any ff 6 S,w G S* and r G P 
we have when it is defined both: 


farfl 

fro {UW 


uw) 1 f r (uwa ) 

n7(mu) 


far (vw) 1 f r (vwa) 

/r 0 (™)'7H) 


This right congruence has finite index, according to |RS91j . thus we obtain the 
left automaton of the bimachine B R defining /. 

For the purpose of this report, we want to study the algebraic properties of L R 
and for this we need to show that the functions f r , r G R maintain some algebraic 


properties of T (given below in Proposition 5.2.2 and Corollary 5.2.1) and for this 


we give the construction of a family of transducers defining them. Let us give a 
construction of a transducer defining f r , for r G R. According to |RS91] there exists 
L r = {t’i,... v m } finite such that f r (u ) = /\ { f(uv) \ v G L r }. 

Let T x R = (Q x R, I x R, F x {r 0 } , A', if t') be the transducer that does exactly 
as T does except that it guesses in a non-deterministic fashion the congruence class 
of the word read, and checks by taking the transitions of R backwards to reach 
tq. In particular this transducer defines the function /, the states are just refined to 
also give the class in R of the suffix left to read. 

Now from this transducer we want to build T r = (Qr, Ir, F r , A R , i R , t r ) defining 
f r with ir = A {i(q) | q G /}. The construction will be similar to the determinization 
algorithm, except that only states with same class in R will be merged. The algorithm 
will end since for Wi^ R w 2 we have by definition the left distance between f(wwi) 
and f{ww 2 ) that is bounded. 

Each state of T r is composed of a set of pairs (q, w) G Q x £* and a class sGk 
For ir = A{i(q) \ q G 1} and s G R, each ({(q,w) \ i(q) = i R w} , s) belongs to Ir. 
Let (P, s ) be a state of Qr, let a G E and s' G £ such that s = as'. Then we can 
determine a set 

3 (q, w) G P and q —if 


S = 


, wu) 


Let v be the longest common prefix of the words of S then we define 


P' = {(qfw')\(qfvw')eS} 
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There is then a transition (P, s) — -y (P', s'). We define F r = {(P, s) G Qr | s — r}. 
Let (P, r) G P r with P = {(gi, wfi ),..., (q m , w m )}. For any v G L r there exists j v such 

that for some qj v G F, qj v — qf v . Then we define t r ((P,r )) = /\ | v G L r }. 

By construction, T r defines f r , furthermore this construction has the advantage 
of yielding an unambiguous transducer: for a given word, there is only one sequence 
of states in R corresponding to it and since all states in Q x R following this sequence 
are merged, there is only one possible accepting run. 

Let Tr = (Qr, Ir, A r) denote the (unique) underlying automaton associated with 
the T r ’s (when not taking final states into account). 

For the following, we also need to show that / is definable by a NFT with Tr as its 
underlying automaton. We define Tf = (Qr, Ir, P/, A R ,i R ,tf) with the final states 
Ff {( P, r 0 ) | 3 (p,w) £ P and p G P} and the terminal function T/(P, r 0 ) = wt(p) for 
(p, w) G P and p G F. 

The construction of Tr allows us to claim the following property and its corollary. 


Proposition 5.2.2. Let f be a rational function defined by some V-NFT. If the 
automaton Tr is a V-automaton, then so is the automaton L R = (£*/ [e:],5), 

where 5 is the natural right action on £*/ ~ L . 


Proof: Let T = (Q, I, F, A,i,t) be a V-NFT defining /. According to Propo¬ 
sition 2.2.1 it suffices to show = t r □ ~l-r. Let mi =t r m 2 , r G R and w G £*. 

t 1 / 7—. x mi\ui , x w\v , x <t\v' , x , / „ x m 2 |«2 

Let (Po,miwar) -> (Pi, war) —y (P 2 ,ar) — > (P3,r) and ( Po,miwar ) - y 


(Pi, war) ——y (P 2 ,ar) AA. (P 3 ,r) be successful runs of T r (if they exist). Then we 
have: ^ 

f r (miwa ) = uivv't r (P 5 ,r) 
f r (;m 2 wa) = u 2 vv't r (P 3 ,r) 
fafimiw) = Uivt ar (P 2 , ar) 
fffr(m 2 w) = u 2 vt ar (P 2 , ar) 

By definition t ar (P 2 ,ar) is indeed a prefix of v't r (P 3 ,r), hence /^rCmi ) -1 f r (miw) = 
fwr(m 2 )~ l f r (m 2 w). 


r \v' 


We have mi G dom(/) if and only if m 2 G dom(/). Let (Po,miu;ro) 


mi | wi 


(Pi,tcr 0 ) — (P 2 ,r) and (P 0 ,mitcr 0 ) 2 -> (Pi,wr 0 ) —^—y (P 2 ,r) be successful runs 
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a, b 



a,b a,b 


Figure 5.1: Automaton T R . 

of 7) (if they exist). Then we have: 

f(miw ) = 

f{m 2 w) = u 2 vt f (P 2 ,r 0 ) 
fr 0 (miw) = uivt ro (P 2 ,r 0 ) 
fr 0 (m 2 w ) = u 2 vt ro (P 2 , r o) 

By definition t ro (P 2 ,r 0 ) is indeed a prefix of tf(P 2 , r 0 ) since £ 6 r 0 , hence /r 0 (mi) _1 /(mi) 
/r 0 (^ 2 ) _1 /(^ 2 ), and mi m 2 . □ 


Corollary 5.2.1. Let V be a monoid variety stable by determinization of transduc¬ 
ers. Let f be definable by a V-transducer. Then B° is a V-bimachine. 


Proof: According to Proposition |5.2.1 R 0 is a V-automaton. Hence by construc¬ 
tion T Ro is a V-automaton since V is stable by determinization. Hence, according 
to Proposition 5.2.2, L° is also a V-automaton. □ 


Example 5.2.1. We consider the function f en d s and one can show that the automa¬ 
ton Rq is equal to the one of Example |3.1.1 i.e. u ~r 0 v if they have the same last 


letter. The automaton T R we obtain for the NFT defining f en ds is given in Figure |5T| 
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5.2.3 Aperiodic case 


In Chapter[4]we have proven that aperiodic transducers are stable by determinization. 
Hence we can decide if a given function is definable by an aperiodic transducer, and 
claim the following Theorem: 

Theorem 5.2.1. There is an algorithm to decide if a function defined by a transducer 
is definable by an FO-translation. 


Proof: Let us assume / is definable by an A-NFT. Then its domain is an 
A-language and, according to Corollary 5.2. 1[ the canonical bimachine is an A- 
bimachine. Let us assume that / is definable by an FO-translation. According 
to Proposition 3.2.2, we can construct a complete aperiodic bimachine defining /. 
Then according to Proposition 5.1.1 we can construct an A-NFT defining /. Hence 
to decide if a rational function given by a NFT is definable by a A-NFT, we need 
to: 


1 . 

2 . 

3. 


Check that the domain is an aperiodic language. 
Construct the canonical bimachine B°. 


If the domain is an aperiodic language and B° is aperiodic, from Proposi- 
tion|5.1.l1we can build a complete aperiodic bimachine B defining the function. 


4. From B, according to Proposition 3.2.1[ we can construct an FO-translation 
defining the function. 


□ 


Remark 5.2.2. In particular, a function is definable by an aperiodic transducer if 
and only if it is definable by an unambiguous transducer, since the construction from 
bimachine to transducer always yields an unambiguous transducer. 


5.3 General variety 

In this section we try to generalize the previous result to any variety. We show that 
if a function is definable by an unambiguous V-NFT, then there exists a bimachine 
amongst a finite set of ’’canonical” bimachines that is a V-bimachine. We first show 
the result on total functions and then extend it to the general case. 

Let A, A' be two deterministic left (resp. right) automata. In order to simplify 
the notations, we will write A\T A 1 and say that the automaton A is finer than A 1 
whenever C ~ A ,. 
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5.3.1 Total functions 


The construction is a finer version of the previous one and heavily uses the following 
Theorem of [ R.S91 J. 

Theorem 5.3.1. Let f be a total rational function, let R be a right automaton finer 
than Rq and let L R be the left automaton of the bimachine B R . Then we say that 
L r is minimal for R in the following sense: for any bimachine defining f with R as 
its right automaton and L' as its left automaton, we have 

L' □ L R 

We also use the following Proposition which holds for total functions only. 

Proposition 5.3.1. Let B = (L, R,u, A, p) be a bimachine defining some total func¬ 
tion f. Then we have both L C L 0 and R □ Rq. 

Proof: Let us show R C R 0 since the proof is the same for L 0 . Let u,v G S* 
such that u v. ft means that ro A r and ro A r for some r € R. Let 
w = <j\ ... C7 n G £ n and let Iq -A l\... l n -1 -A- l n be the execution of w on L. 
Similarly, let r n A- r n _i... r\ A- r be the execution of w on R from r. Then 

\\f{wu), f(wv) || = || \(r n )u(l 0 , w, r)u(l n , u, r 0 )p(l), X(r n )u(l 0 , w, r)u(l n , v, r 0 )p(Z , )|| 
= \\uj(l n , u, r 0 )p(l),uj(l n , v, r Q )p(V) || 

< k(\u\ + |n| + 1) 

with k being the maximum size of an output in B. Hence u ~i? 0 v. □ 

In the following Proposition, we show that L° is maximal amongst the set of 
minimal left automata, but before we need the Lemma: 

Lemma 5.3.1. Let B be a bimachine with L as its left automaton and R as its right 
automaton. Let L' be finer than L and R! be finer than R. Then there is a bimachine 
B' with L' and R' as its automata and equivalent to B. 

Proof: Since LI and R! are finer than L and R , respectively, the bimachine B' 
can behave as B by disregarding the finer information on its states. □ 


Proposition 5.3.2. Let f be a total rational function. Let R be a right automaton 
finer than R 0 . Let L R be the left automaton of the bimachine B R . Then 

L° C L r 
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Proof: This follows from Theorem 5.3.1 Since R □ R 0 , there is, according to 


Lemma 5.3.1, a bimachine defining f with L° and R as its automata and we have 
L° T L r . □ 


For a complete function / we have proven that L° is maximal amongst the set 
of minimal left automata. This means that if we want to show that / is definable 
by some bimachine with a V-automaton as its left automaton, we only have to 
check if the finite set of automaton coarser than L° (and finer than Lq) contains a 
V-automaton. 

If there exists a V-automaton coarser than L° and finer than L 0 , then we define 
Lv as the finest left V-automaton such that L° IT Lv T L 0 . Finally we define 
R v = R Lv as the right automaton of the bimachine B Lw . 

Before moving on, let us show the following lemma: 

Lemma 5.3.2. Let ~i IT ~ 2 be two right (resp. left) congruences. Let = 1; = 2 denote 
the transition congruences of the left (resp. right) automata associated with ~i and 
~ 2 , respectively. Then =i is finer than = 2 . 

Proof: Let us assume that ~i, ~ 2 are left congruences, the proof being completely 
symmetrical. We have that u =i v if and only if for all w G £*, wu wv. Hence 
we can easily see that u =i v u = 2 v. □ 


We can now claim the desired property: 

Proposition 5.3.3. Let V he a monoid variety. Let f be a total rational function. 
The function f is definable by an unambiguous V-NFT if and only if R v is defined 
and is a V-automaton. 


Proof: Let / be a total function, let us assume that there exists a V-NFT defining 
/. According to Proposition 5.1.1 there exist a V-bimachine B = (L, R, u>, A, p) defin¬ 
ing /. Let L R be the left automaton of the bimachine B R with R as its right automa¬ 
ton. Then, according to Theorem 5.3.1 , we have L T L R 
L r is a V-automaton. We have L° IT L v T L R T Lq 


hence from Lemma 5.3.2 
Let R' = R( lR ^ be the right 
automaton of the bimachine B^l with L R as its left automaton. We have, again 

R T R', hence R' is also a V-automaton. We have 
Hence according to Lemma 5.3.2, 7? v is also a V-automaton. 


according to Theorem 5.3.1 
R 0 HR' C A v T Rq. 


Conversely, we have in particular H Lv a V-bimachine dehning the total function 
/ hence, according to Proposition |5. 1.1, we can obtain a V-NFT dehning /. □ 
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Remark 5.3.1. In particular, total functions are completely characterized by a finite 
set of canonical bimachines in the following sense: Let / be a complete function 
defined by a bimachine B with L and R as its left and right automata. Then there 
exists a bimachine B min defining / with L min and R m in as its left and right automata 
with L Z L min and R Z R m in such that: L° Z L min Z L 0 and R° Z R min Z R 0 . 


5.3.2 Partial functions 


Now that we have the result for total function, we need to extend it to partial 
function. Let / : £* —> £* be a (partial) rational function. We define the completion 
of /: 

/: £* ->■ (SW{1})* 

f(u) if u G dom(/) 

_!_ otherwise 


u 


!->• 


Lemma 5.3.3. Let V be a monoid variety. Let f be a rational function such that 
dom(/) is a V-language. The function f is definable by an unambiguous V-NFT if 
and only if f is definable by an unambiguous V-NFT. 

Proof: Let us assume / is definable by an unambiguous V-NFT T. Then there 
is a V-automaton recognizing the complement of dom(/). We can easily see that 
the complement of a V-language is a V-language: If M is a monoid in V such that 
dom(/) = /i _1 (P) for some morphism /i and P C M, then X*\dom(/) = p _1 (M\P). 
By adding to T a deterministic transducer that maps any word not in the domain 
of / to _L, we obtain an unambiguous V-NFT defining /. 

Conversely, let us assume / is definable by a V-NFT. Since / is total, according 


to Proposition 5.1.1 / is definable by a V-bimachine. Then by restricting the output 
function to values not equal to _L, we obtain a V-bimachine defining /. According 
to Proposition 5.1.1 again, / is definable by a V-NFT. □ 


Remark 5.3.2. We have reduced the problem of deciding whether a function is de¬ 
finable by an unambiguous V-NFT to the membership problem of V. 


Theorem 5.3.2. Let V be a monoid variety. Let us assume that the membership 
problem is decidable for V. Then there is an algorithm to decide if a function defined 
by an NFT is definable by an unambiguous V-NFT. 


Proof. According to Proposition 5.1.1 and Lemma 5.3. 3[ / is definable by an unam¬ 
biguous V-NFT iff / is definable by a V-NFT and the domain of / is a V-language 
iff / is definable by a V-bimachine and the domain of / is a V-language. 
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The construction goes as follows: First we check that the domain of / is a V- 
language. We construct a NFT defining /. Then we construct the bimachine B Lw , if 
it exists, and check if R v is a V-automaton. If it is the case we use the construction 
of Lemma 5.3.3 to obtain a V-bimachine defining /. Finally, from the construction 
of Proposition |5.1.1] we obtain an unambiguous V-NFT defining / 


□ 
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Chapter 6 

Remarks, examples and 
counter-examples 


6.1 Monoid varieties 


The monoid varieties we consider most often are given in Figure 6.1, which is taken 


from |Darl5j |. We also give the corresponding logical and language characterizations. 
The equalities are given with the symbol oj which can be seen as the idempotent 
power of a finite monoid M, i. e. the smallest integer n such that any element to the 
n is idempotent. These equalities, called pro finite equalities, also characterize the 
monoid varieties. For a proper definition we refer the reader to |Pin97j . 

Remark 6.1.1. Since one can determine the idempotent power of a finite monoid, we 
can see that the membership problem is decidable for varieties defined by a finite 


set of profinite equalities. This is true for the varieties listed in Figure 6T and in 
particular for the variety of aperiodic monoids. 


6.2 Determinization 


Here we see that, in general, determinization of transducers does not preserve the 
equalities of the transition monoid. 

Example 6.2.1. Let us consider I the variety of idempotent monoids, i.e. satisfying 
x = x 2 . Let E = {a, b}. We define the function: 


f:a n ^ 


a if n — 1 
b if n > 1 
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Variety 

Equations 

Logic 

Languages 

Finite Monoids 

- 

MSO 

Rational 

Commutative Com 

xy = yx 

MSO[=] 

Commutative 

Aperiodic A 


FO 

Star-free 

D-Aperiodic DA 

(xyz) uj y(xyz) ul = {;xyz) u 

FO 2 = A 2 

Unambiguous 

Ji 

x = x 2 , xy = yx 

FO 1 

- 

^-trivial J 

y{;xyY = (xy) u = ( xy) u x 

BE 1 

Piecewise 

testable 


Figure 6.1: Monoid varieties 



Figure 6.2: I-NFT. 


The NFT of Figure (L2 defines / and verifies x = x 2 . 

Let us assume that T' is an I-DFT defining /. Then we can decompose /(a) = ut 


and /(a 2 ) = uvt , with q 0 q q and t is the terminal output corresponding to 
q. Then u — t — f(a) = £, which yields the contradiction. 


Example 6.2.2. There is a sequential function definable by a Com-NFT that is 
not definable by any Com-DFT. Let E = {a,b,c}. We define the function /: 


ab (->■ a 
aab i —y a 
aba i—>• ab 
ia 4 c 
baa t-)- c 


The NFT of Figure 6.3 defines / and verifies xy — yx. 

Let us assume that V is a DFT defining / and verifying xy = yx. 
can decompose f(ab) = u\t\J{ba) = ^i, f(aba) = uivt 2 and f(baa) = 


Then we 
u 2 vt 2 with 
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ab\u± a\v ba\u 2 . 

qo - > q\ —> g 2 , qo - > q 1 and H, £2 are the terminal outputs corresponding to 

qi and < 72 , respectively. Since f(aba) and f{baa ) have no letter in common, we have 
v = t 2 = £■ Hence iq = ab, and we have a contradiction. 


Example 6.2.3. There is a sequential function definable by a J-NFT that is not 


definable by any J-DFT. Let E = {a, a 0 , ai,b,b 0 ,bi,c, d}. The J-NFT of Figure 6.4 
defines a subsequential function /. 

Let us assume that T' is a J-DFT defining /. Then we can decompose /((a&) a; c) = 


ut c , f((ab) u ac ) = uat c and f((ab) u ad) = uatd with q 0 


(afy^lu 


> q 


a\a 


q. Since 


/((a6)‘ J c) and /((a6)‘ J ac) have no common suffix, we have t c = e. Hence we obtain 
that f((ab) w ac) is a prefix of /((ab^ad) which gives the contradiction. 
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Figure 6.5: Left automaton of the bimachine B. 



6.3 V-domain 


Let E = {a, b}. We give an example of a Com-bimachinc B = (L, R,oj, \, p) such 
that its domain is not a commutative language. The right automaton is the complete 
automaton with one state. The left automaton is given in Figure 6.5| The functions 
A and p are constant and equal to e. We define u>(l 0 ,a,r 0 ) = u(li,b,r 0 ) = e and u> 
is not defined otherwise. Then the domain of the function is equal to {ab} which is 
not a commutative language. 


6.4 Disambiguation 


Example 6.4.1. It has been shown that any rational function is definable by some 
unambiguous transducer. Moreover there is for transducers a disambiguation algo¬ 
rithm which yields an unambiguous transducer with an exponential number of states. 
Let E = {a, b}. We show in Figure 6.6 an example of an aperiodic automaton for 


which the algorithm described in [FGR12] never yields an aperiodic automaton. 

As you can see this automaton is aperiodic, yet not counter-free. For state 0 
there are two possible transitions reading letter a. If we choose (0, a, 1) <$ (0, a, 3) 
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then in the disambiguated automaton we have 


( 0 , 0 ) ^ ( 2 , 0 ) 

If and only if k is odd. Similarly if (0, a, 3) <$ (0, a, 1) then 

(0,0) ^ (4,0) 

If and only if k is odd. In both cases the obtained automaton is periodic. 
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Conclusion 


Using techniques inspired by [ Cho03 , lRS91j . we have been able to extend decidability 
results for varieties of rational languages to varieties of rational functions. In the 
deterministic case the notion of minimal DFT gives a natural way to decide if a 
subsequential functions belongs to a given variety. In the non-deterministic case we 
need to consider not one minimal machine, but a finite set of canonical bimachines 
and if one of these machines satisfies the desired algebraic property, then the function 
it defines belongs to the corresponding variety. 

Some of the questions that remain are for instance the complexity of the decision 
procedures described in this report. The question of the equivalence between V-NFT 
and unambiguous V-NFT for a general variety also remains unanswered. 

We foresee several interesting directions in which the techniques of this report 
can be taken. The first one would be to try to extend the results to the more general 
class of regular functions, defined by two-way transducers jEHni] and streaming 


string transducers |AC10 , or some intermediate class. A second possible direction 


is the generalization to relations, with probably some condition of finite value, for 
instance by bounding the ambiguity of the transducers. Another extension would be 
to consider infinite words and see for instance if some minimization is possible for 
DFTs on infinite words. There is also the domain of weighted automata for which 
common techniques are shared with transducers |FGR14j . 
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